NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Scalable spectral representations for multi-agent reinforcement learning in network MDPs

Ren, Zhaolin; Zhang, Runyu; Dai, Bo; Li, Na (May 2025, AIstats, PMLR)

Full Text Available
Scalable spectral representations for multiagent reinforcement learning in network MDPs

Ren, Zhaolin; Zhang, Runyu; Dai, Bo; Li, Na (May 2025, PMLR)
Li, Yingzhen; Mandt, Stephan; Agrawal, Shipra; Khan, Emtiyaz (Ed.)
Full Text Available
Scalable spectral representations for multiagent reinforcement learning in network MDPs

Ren, Zhaolin; Zhang, Runyu; Dai, Bo; Li, Na (May 2025, Proceedings of Machine Learning Research)
Li, Yingzhen; Mandt, Stephan; Agrawal, Shipra; Khan, Emtiyaz (Ed.)
Network Markov Decision Processes (MDPs), which are the de-facto model for multi-agent control, pose a significant challenge to efficient learning caused by the exponential growth of the global state-action space with the number of agents. In this work, utilizing the exponential decay property of network dynamics, we first derive scalable spectral local representations for multiagent reinforcement learning in network MDPs, which induces a network linear subspace for the local $$Q$$-function of each agent. Building on these local spectral representations, we design a scalable algorithmic framework for multiagent reinforcement learning in continuous state-action network MDPs, and provide end-to-end guarantees for the convergence of our algorithm. Empirically, we validate the effectiveness of our scalable representation-based approach on two benchmark problems, and demonstrate the advantages of our approach over generic function approximation approaches to representing the local $$Q$$-functions.
more » « less
Full Text Available
Stochastic Nonlinear Control via Finite-Dimensional Spectral Dynamics Embedding

https://doi.org/10.1109/TAC.2025.3580296

Ren, Zhaolin; Ren, Tongzheng; Ma, Haitong; Li, Na; Dai, Bo (December 2025, IEEE Transactions on Automatic Control)

Full Text Available
Offline Imitation Learning upon Arbitrary Demonstrations by Pre-Training Dynamics Representations

https://doi.org/10.1109/IROS60139.2025.11246167

Ma, Haitong; Dai, Bo; Ren, Zhaolin; Wang, Yebin; Li, Na (October 2025, IEEE)

Full Text Available
Skill Transfer and Discovery for Sim-to-Real Learning: A Representation-Based Viewpoint

https://doi.org/10.1109/IROS58592.2024.10801637

Ma, Haitong; Ren, Zhaolin; Dai, Bo; Li, Na (October 2024, IEEE)

Full Text Available
Distributed Thompson Sampling Under Constrained Communication

https://doi.org/10.1109/LCSYS.2024.3525096

Zerefa, Saba; Ren, Zhaolin; Ma, Haitong; Li, Na (January 2024, IEEE Control Systems Letters)

Full Text Available
On Controller Reduction in Linear Quadratic Gaussian Control with Performance Bounds

Ren, Zhaolin; Zheng, Yang; Fazel, Maryam; Li, Na (June 2023, Proceedings of Machine Learning Research)

The problem of controller reduction has a rich history in control theory. Yet, many questions remain open. In particular, there exist very few results on the order reduction of general non-observer based controllers and the subsequent quantification of the closed-loop performance. Recent developments in model-free policy optimization for Linear Quadratic Gaussian (LQG) control have highlighted the importance of this question. In this paper, we first propose a new set of sufficient conditions ensuring that a perturbed controller remains internally stabilizing. Based on this result, we illustrate how to perform order reduction of general (non-observer based) output feedback controllers using balanced truncation and modal truncation. We also provide explicit bounds on the LQG performance of the reduced-order controller. Furthermore, for single-input-single-output (SISO) systems, we introduce a new controller reduction technique by truncating unstable modes. We illustrate our theoretical results with numerical simulations. Our results will serve as valuable tools to design direct policy search algorithms for control problems with partial observations.
more » « less
Full Text Available
On Controller Reduction in Linear Quadratic Gaussian Control with Performance Bounds

Ren, Zhaolin; Zheng, Yang; Fazel, Maryam; Li, Na (June 2023, Learning for Dynamics and Control Conference)

Full Text Available
Zeroth-order feedback optimization for cooperative multi-agent systems

https://doi.org/10.1016/j.automatica.2022.110741

Tang, Yujie; Ren, Zhaolin; Li, Na (February 2023, Automatica)

Full Text Available

« Prev Next »

Search for: All records